A Conversational Model of Multimodal Interaction
نویسندگان
چکیده
Multimodal interaction is employed in a variety of contemporary information systems in order to enhance the flexibility and naturalness of the user interface. In this paper, we propose a comprehensive framework which allows the modeling and specification of multimodal interactions. To this end, we employ an extended notion of ‘dialogue acts’ which can be performed by linguistic or non-linguistic means. On this basis we discuss the temporal structure of multimodal interaction. First, a comprehensive set of constraints is presented that describes all patterns of exchange which can be encountered during a cooperative information-seeking dialogue. Second, we introduce a strategic level of description, which allows the specification of the topical structure of the dialogue according to a selected information-seeking strategy. The model was used to design and implement the MERIT system (Multimedia Extensions to Retrieval Interaction Tools), and led to a reduction in the complexity of the user interface while preserving most of the useful, but sometimes confusing, dialogue options of advanced direct manipulative interfaces. An abbreviated version is published in: Proc. of the 11th National Conference on Artificial Intelligence (AAAI ’93), Washington DC, USA, July 11-16, 1993. Menlo Park: AAAI Press/ The MIT Press, 1993, pp. 283-288. Also available as: Arbeitspapiere der GMD No. 741, Sankt Augustin: GMD, March 1993. Address all correspondence to: Dr. Adelheit Stein, Dr. Ulrich Thiel, GMD-IPSI, Dolivostraße 15, D-64293 Darmstadt E-mail: [email protected] Phone: ++ 49 / 6151 / 869-841 2 A Conversational Model of Multimodal Interaction Stein, Thiel
منابع مشابه
Physically embodied conversational agents as health and fitness companions
We present a physical multimodal conversational Companion in the area of health and fitness. Conversational spoken dialogues using physical agents provide a potential interface for applications which are aimed at motivating and supporting users. Open source software called jNabServer, which enables spoken and multimodal interaction with Nabaztag/tag wireless rabbits, is presented together with ...
متن کاملUso de Canales de Comunicación Adicionales en los Sistemas Conversacionales
In recent years there has been an increasing interest concerning the integration of speech and other communication media in conversational systems, originating the so-called multimodal conversational systems. When several interaction modalities are available, users can receive more feedback from a conversational system and it can also receive more information from users, leading to a reduction ...
متن کاملStructuring collaborative information-seeking dialogues
Conversational approaches to human-computer collaboration have so far mostly been employed for the design of natural language interfaces. We claim, however, that our “conversational interaction model” can also be applied feasibly to graphical and multimodal interactions. The model comprises two interrelated parts: first, the description of local discourse structures and functional interrelation...
متن کاملModeling and Guiding Cooperative Multimodal Dialogues
In this paper we claim that a consistent conversational approach to human-computer interaction can be applied feasibly to multimodal interaction. A comprehensive conversational model is presented that covers interrelated levels of the dialogue structure, i.e. illocutionary, rhetorical, and topical aspects. It thus provides the basis for a consistent interpretation of the linguistic as well as g...
متن کاملZooming on Multimodality and Attuning: A Multilayer Model for the Analysis of the Vocal Act in Conversational Interactions
The most recent research about both human-human conversational interaction and human-computer agents conversational interaction is marked by a multimodal perspective. On the one hand this approach underlines the cooccurrence and synergy between different languages and channels, on the other hand it highlights the need for joined and coordinated action between various subjects (attuning and mutu...
متن کاملSemantics-based Representation for Multimodal Interpretation in Conversational Systems
To support context-based multimodal interpretation in conversational systems, we have developed a semantics-based representation to capture salient information from user inputs and the overall conversation. In particular, we present three unique characteristics: fine-grained semantic models, flexible composition of feature structures, and consistent representation at multiple levels. This repre...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1993